Added temporary fix for channels-last-like tensors for Resize op #4979

vfdev-5 · 2021-11-23T15:39:45Z

Description:

Added temporary fix for channels-last-like tensors for Resize op

Benchmarks: code

Upsampling 64 -> 224, x2 | x3 speed up for PTH->DType->Resize (64->224) ->Norm

Torch version: 1.11.0.dev20211123+cu111
Torchvision version: 0.12.0a0
PIL version: 8.4.0

# Master
[----------- Benchmark reader+transformation -----------]
                                                |   Time 
1 threads: ----------------------------------------------
      PIL->Resize (64->224) ->Tensor->Norm      |  1410.6
      PIL->Tensor->Resize (64->224) ->Norm      |   847.1
      PTH->DType->Resize (64->224) ->Norm       |  1574.1
      JIT: PTH->DType->Resize (64->224) ->Norm  |  1502.3
6 threads: ----------------------------------------------
      PIL->Resize (64->224) ->Tensor->Norm      |  1085.2
      PIL->Tensor->Resize (64->224) ->Norm      |   610.7
      PTH->DType->Resize (64->224) ->Norm       |  1314.2
      JIT: PTH->DType->Resize (64->224) ->Norm  |  1225.7

# This PR
[----------- Benchmark reader+transformation -----------]
                                                |   Time 
1 threads: ----------------------------------------------
      PIL->Resize (64->224) ->Tensor->Norm      |  1408.1
      PIL->Tensor->Resize (64->224) ->Norm      |   843.9
      PTH->DType->Resize (64->224) ->Norm       |   706.1
      JIT: PTH->DType->Resize (64->224) ->Norm  |   616.0
6 threads: ----------------------------------------------
      PIL->Resize (64->224) ->Tensor->Norm      |  1096.2
      PIL->Tensor->Resize (64->224) ->Norm      |   606.2
      PTH->DType->Resize (64->224) ->Norm       |   462.8
      JIT: PTH->DType->Resize (64->224) ->Norm  |   367.7

Times are in microseconds (us).

Notations:

PIL->Resize (64->224) ->Tensor->Norm = pil_loader, [T.Resize((osize, osize)), T.ToTensor(), T.Normalize(mean, std)]
PTH->DType->Resize (64->224) ->Norm = torch_loader, [T.ConvertImageDtype(torch.float), T.Resize((osize, osize)), T.Normalize(mean, std)]

Downsampling 500 -> 224, ~12% | ~25% speed up for PTH->DType->Resize (500->224) ->Norm

Torch version: 1.11.0.dev20211123+cu111
Torchvision version: 0.12.0a0
PIL version: 8.4.0

# Master
[---------- Benchmark reader+transformation -----------]
                                                 |  Time
1 threads: ---------------------------------------------
      PIL->Resize (500->224) ->Tensor->Norm      |  7.0 
      PIL->Tensor->Resize (500->224) ->Norm      |  7.9 
      PTH->DType->Resize (500->224) ->Norm       |  4.3 
      JIT: PTH->DType->Resize (500->224) ->Norm  |  4.2 
6 threads: ---------------------------------------------
      PIL->Resize (500->224) ->Tensor->Norm      |  7.1 
      PIL->Tensor->Resize (500->224) ->Norm      |  6.8 
      PTH->DType->Resize (500->224) ->Norm       |  3.8 
      JIT: PTH->DType->Resize (500->224) ->Norm  |  3.6 

# This PR
[---------- Benchmark reader+transformation -----------]
                                                 |  Time
1 threads: ---------------------------------------------
      PIL->Resize (500->224) ->Tensor->Norm      |  7.0 
      PIL->Tensor->Resize (500->224) ->Norm      |  7.9 
      PTH->DType->Resize (500->224) ->Norm       |  3.8 
      JIT: PTH->DType->Resize (500->224) ->Norm  |  3.7 
6 threads: ---------------------------------------------
      PIL->Resize (500->224) ->Tensor->Norm      |  7.0 
      PIL->Tensor->Resize (500->224) ->Norm      |  6.8 
      PTH->DType->Resize (500->224) ->Norm       |  3.0 
      JIT: PTH->DType->Resize (500->224) ->Norm  |  3.0

Times are in milliseconds (ms).

Notations:

PIL->Resize (500->224) ->Tensor->Norm = pil_loader, [T.Resize((osize, osize)), T.ToTensor(), T.Normalize(mean, std)]
PTH->DType->Resize (500->224) ->Norm = torch_loader, [T.ConvertImageDtype(torch.float), T.Resize((osize, osize)), T.Normalize(mean, std)]

PS: to avoid reverting this PR, let's wait for Francisco's review.

facebook-github-bot · 2021-11-23T15:39:52Z

💊 CI failures summary and remediations

As of commit 93522e4 (more details on the Dr. CI page):

1/1 failures introduced in this PR

🕵️ 1 new failure recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

unittest_linux_cpu_py3.8 (1/1)

Step: "Run tests" (full log | diagnosis details | 🔁 rerun)

/root/project/torchvision/io/video.py:399: Runt...log: [mov,mp4,m4a,3gp,3g2,mj2] moov atom not found


test/test_internet.py::TestDatasetUtils::test_download_url_retry_http
  /root/project/env/lib/python3.8/email/feedparser.py:475: ResourceWarning: unclosed <ssl.SSLSocket [closed] fd=13, family=AddressFamily.AF_INET, type=SocketKind.SOCK_STREAM, proto=6>
    for lineno, line in enumerate(lines):

test/test_internet.py::TestDatasetUtils::test_download_url_dispatch_download_from_google_drive
  /root/project/env/lib/python3.8/unittest/mock.py:2030: ResourceWarning: unclosed <ssl.SSLSocket [closed] fd=13, family=AddressFamily.AF_INET, type=SocketKind.SOCK_STREAM, proto=6>
    setattr(_type, entry, MagicProxy(entry, self))

test/test_io.py::TestVideo::test_read_video_timestamps_corrupted_file
  /root/project/torchvision/io/video.py:399: RuntimeWarning: Failed to open container for /tmp/tmpjwjajg42.mp4; Caught error: [Errno 1094995529] Invalid data found when processing input: '/tmp/tmpjwjajg42.mp4'; last error log: [mov,mp4,m4a,3gp,3g2,mj2] moov atom not found
    warnings.warn(msg, RuntimeWarning)

test/test_models.py::test_memory_efficient_densenet[densenet121]
test/test_models.py::test_memory_efficient_densenet[densenet169]
test/test_models.py::test_memory_efficient_densenet[densenet201]
test/test_models.py::test_memory_efficient_densenet[densenet161]
  /root/project/env/lib/python3.8/site-packages/torch/autocast_mode.py:162: UserWarning: User provided device_type of 'cuda', but CUDA is not available. Disabling
    warnings.warn('User provided device_type of \'cuda\', but CUDA is not available. Disabling')

test/test_models.py::test_inception_v3_eval

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

Fixes pytorch#4880 Description: - Added temporary fix for channels-last-like tensors for Resize op Benchmarks:

datumbox

I would be in favour of merging this to resolve the speed issue until there are highly optimized kernels that handle channels last. This is related to pytorch/pytorch#83840 and #6465

cc @NicolasHug

NicolasHug · 2022-08-22T16:14:32Z

I think it's worth getting clarity on whether pytorch/pytorch#83840 is a fix-worthy issue first. Fixing pytorch/pytorch#83840 might make this PR obsolete.

This PR seems very similar to #4898 which was reverted from @fmassa 's #4898 (comment):

This was here on purpose for speed improvements on the reading (avoids copy).

I would revert this PR, and work on improving the performance on the transforms instead so that they better handle channel_last format

that still sounds applicable here?

vfdev-5 requested a review from fmassa November 23, 2021 15:39

pytorch-probot bot added the ciflow/default label Nov 23, 2021

facebook-github-bot added the cla signed label Nov 23, 2021

Added temporary fix for channels-last-like tensors for Resize op

93522e4

Fixes pytorch#4880 Description: - Added temporary fix for channels-last-like tensors for Resize op Benchmarks:

vfdev-5 force-pushed the fix-4880-slow-resize branch from 76a3458 to 93522e4 Compare November 23, 2021 15:40

datumbox reviewed Aug 22, 2022

View reviewed changes

Merge branch 'main' into fix-4880-slow-resize

14d63d9

pmeier removed the ciflow/default label Jun 2, 2023

bossjones mentioned this pull request Jun 10, 2024

pytorch accuracy bossjones/goob_ai#15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added temporary fix for channels-last-like tensors for Resize op #4979

Added temporary fix for channels-last-like tensors for Resize op #4979

vfdev-5 commented Nov 23, 2021

facebook-github-bot commented Nov 23, 2021 •

edited

Loading

datumbox left a comment

NicolasHug commented Aug 22, 2022 •

edited

Loading

Added temporary fix for channels-last-like tensors for Resize op #4979

Are you sure you want to change the base?

Added temporary fix for channels-last-like tensors for Resize op #4979

Conversation

vfdev-5 commented Nov 23, 2021

facebook-github-bot commented Nov 23, 2021 • edited Loading

💊 CI failures summary and remediations

🕵️ 1 new failure recognized by patterns

unittest_linux_cpu_py3.8 (1/1)

datumbox left a comment

Choose a reason for hiding this comment

NicolasHug commented Aug 22, 2022 • edited Loading

facebook-github-bot commented Nov 23, 2021 •

edited

Loading

NicolasHug commented Aug 22, 2022 •

edited

Loading